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-Abstract- 

In this paper we consider two metric covering/clnstering problems - Minimum Cost Covering 
Problem (MCC) and fc-clnstering. In the MCC problem, we are given two point sets X (clients) 
and Y (servers), and a metric on X U U. We wonld like to cover the clients by balls centered at 
the servers. The objective fnnction to minimize is the snm of the Q-th power of the radii of the 
balls. Here a > 1 is a parameter of the problem (bnt not of a problem instance). MCC is closely 
related to the fc-clustering problem. The main difference between fc-clustering and MCC is that 
in fc-clnstering one needs to select k balls to cover the clients. 

For any e > 0, we describe quasi-polynomial time (1 -|- e) approximation algorithms for both 
of the problems. However, in case of fc-clustering the algorithm uses (1 -f e)k balls. Prior to 
our work, a 3“ and a c“ approximation were achieved by polynomial-time algorithms for MCC 
and fc-clustering, respectively, where c > 1 is an absolute constant. These two problems are thus 
interesting examples of metric covering/clustering problems that admit (1 + £)-approximation 
(using (1 -|- e)k balls in case of fc-clustering), if one is willing to settle for quasi-polynomial time. 
In contrast, for the variant of MCC where a is part of the input, we show under standard 
assumptions that no polynomial time algorithm can achieve an approximation factor better than 
0(log |X|) for a > log |X|. 
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[T] Introduction 

We consider two metric covering/clustering problems. In the first problem, we are given two 
point sets X (clients) and Y (servers), and a metric d an X CY. For z a X CY and r > 0, 
the ball B{z,r) centered at z and having radius r > 0 is the set {y £ X Li Y\d{z,y) < r}. 
A cover for a subset P C X is a set of balls, each centered at a point of Y, whose union 
contains P. The cost of a set P = {Pi,..., P^} of balls, denoted by cost(P), is X]i=i 
where r{Bi) is the radius of P^, and a > I is a parameter of the problem (but not of a 
problem instance). The goal is to compute a minimum cost cover for the clients X. We refer 
to this problem as the Minimum Cost Covering Problem (MCC). 

In the second problem, we are given a set X of n points, a metric d on X, and a positive 
integer fc. Unlike in the case of MCC, here each ball is centered at a point in X} The cost 
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cost(S) of a set B of balls is defined exactly in the same way as in the case of MCC. The 
goal is to find a set B oi k balls whose union contains all the points in X and cost(B) is 
minimized. We refer to this problem as /c-clustering. 

Inspired by applications in wireless networks, MCC has been well studied [21]. One can 
consider the points in Y as the potential locations of mobile towers and the points in X 
as the locations of customers. A tower can be configured in a way so that it can serve the 
customers lying within a certain distance. But the service cost increases with the distance 
served. The goal is to serve all the customers minimizing the total cost. For modelling the 
energy needed for wireless transmission, it is common to consider the value of a to be at 
least 1. 

For the MCC problem with a = 1, a primal-dual algorithm of Charikar and Panigrahy 
[9] leads to an approximation guarantee of 3; their result generalizes to a > 1, with an 
approximation guarantee of 3“. The problem is known to be NP-hard for a > 1, even when 
X and Y are points in the Euclidean plane [2]. The case a = 1 has received particular 
attention. The first PTAS for the Euclidean plane was designed by Lev-Tov and Peleg 
[21]. Later, Gibson et. al [16] have designed a polynomial time exact algorithm for this 
problem when X and Y are points in the plane, and the underlying distance function d is 
either the li or loo metric. For the I 2 metric they also get an exact algorithm if one assumes 
two candidate solutions can be compared efficiently; without this assumption, they get a 
(1 -f e) approximation. Their algorithm is based on a separator theorem that, for any optimal 
solution, proves the existence of a balanced separator that intersects with at most 12 balls in 
the solution. In a different work they have also extended the exact algorithm to arbitrary 
metric spaces [15]. The running time is quasi-polynomial if the aspect ratio of the metric 
(ratio of maximum to minimum interpoint distance) is bounded by a polynomial in the 
number of points. When the aspect ratio is not polynomially bounded, they obtain a (1 -I- e) 
approximation in quasi-polynomial time. Their algorithms are based on a partitioning of the 
metric space that intersects a small number of balls in the optimal cover. 

When a > 1, the structure that holds for a = 1 breaks down. It is no longer the case, 
even in the Euclidean plane, that there is a good separator (or partition) that intersects 
a small number of balls in an optimal solution. In the case a = 2 and the Euclidean 
plane, the objective function models the total area of the served region, which arises in 
many practical applications. Hence this particular version has been studied in a series of 
works. Chuzhoy developed an unpublished 9-factor approximation algorithm for this version. 
Freund and Rawitz [14] present this algorithm and give a primal fitting interpretation of the 
approximation factor. Bilo et. al [8] have extended the techniques of Lev-Tov and Peleg [21] 
to get a PTAS that works for any a>l and for any fixed dimensional Euclidean space. The 
PTAS is based on a sophisticated use of the shifting strategy which is a popular technique in 
computational geometry for solving problems in [12, 18]. For general metrics, however, 
the best known approximation guarantee for a > 1 remains the already mentioned 3“ [9]. 

The fc-clustering problem has applications in many fields including Data Mining, Machine 
Learning and Image Processing. Over the years it has been studied extensively from both 
theoretical and practical perspectives [8, 9, 11, 15, 16, 22]. The problem can be seen as a 
variant of MCC where Y = X and at most k balls can be chosen to cover the points in X. 
As one might think, the constraint on the number of balls that can be used in fc-clustering 
makes it relatively harder than MCC. Thus all the hardness results for MCC also hold for 
fc-clustering. For a = 1, Charikar and Panigrahy [9] present a polynomial time algorithm with 
an approximation guarantee of about 3.504. Gibson et. al [15, 16] obtain the same results 
for fc-clustering with a = 1 as the ones described for MCC, both in and arbitrary metrics. 
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Recently, Salavatipour and Behsaz [7] have obtained a polynomial time exact algorithm for 
a = 1 and metrics of unweighted graphs, if we assume that no singleton clusters are allowed. 
However, in case of a > 1 the best known approximation factor (in polynomial time) for 
general metrics is c“, for some absolute constant c > 1; this follows from the analysis of 
Charikar and Panigrahy [9], who explicitly study only the case a = 1. In fact, no better 
polynomial time approximation is known even for the Euclidean plane. We note that though 
the polynomial time algorithm in [8] yields a (1 + e) approximation for fc-clustering in any 
fixed dimensional Euclidean space and for a > 1, it can use (1 + e)k balls. 

In addition to ^-clustering many other clustering problems (fc-means, fc-center, fc-median 
etc.) have been well studied [4, 10, 23, 17]. 

In this paper we address the following interesting question. Can the techniques employed 
by [8] for fixed dimensional Euclidean spaces be generalized to give (1 -I- e) approximation 
for MCC and fc-clustering in any metric space? Our motivation for studying the problems 
in a metric context is partly that it includes two geometric contexts: (a) high dimensional 
Euclidean spaces; and (b) shortest path distance metric in the presence of polyhedral obstacles 
in or K^. 


1.1 Our Results and Techniques 

In this paper we consider the metric MCC and fc-clustering with a > 1. For any e > 0, we 
design a (1 -f e)-factor approximation algorithm for MCC that runs in quasi-polynomial time, 
that is, in where c > 0 is a constant, m = |F|, and n = |W|. We also have 

designed a similar algorithm for fc-clustering that uses at most (1 + e)k balls and yields a 
solution whose cost is at most (1 -f e) times the cost of an optimal fc-clustering solution. The 
time complexity of the latter algorithm is also quasi-polynomial. As already noted, somewhat 
stronger guarantees are already known for the case a = 1 of these problems [15], but the 
structural properties that hold for a = 1 make it rather special. 

The results in this paper should be compared with the polynomial time algorithms [9] 
that guarantee 3“ approximation for MCC and c“ approximation for fc-clustering. The MCC 
and fc-clustering are thus interesting examples of metric covering/clustering problems that 
admit (1 + e)-approximation (using (1 + e)k balls in case of fc-clustering), if one is willing to 
settle for quasi-polynomial time. From this perspective, our results are surprising, as most of 
the problems in general metrics are APX-hard. The MCC and A:-clustering are also examples 
where the techniques used in fixed dimensional Euclidean spaces generalize nicely to metric 
spaces. This is in contrast to the facility location problem [3]. 

The algorithms that we have designed for both of the problems use similar techniques 
that exploit the following key property of optimal covers: there are only a “small” number 
of balls whose radius is “large”. We can therefore afford to guess these balls by an explicit 
enumeration. However, there can be a “large” number of balls with “small” radius. To help 
‘find’ these, we partition the metric space into blocks (or subsets) with at most half the 
original diameter, and recurse on each block. We have to pay a price for this recursion in the 
approximation guarantee. This price depends on the number of blocks in the partition that 
a small radius ball can intersect. (This is not an issue in the case a = 1, where each ball 
that is not guessed intersects precisely one of the blocks [15].) 

We are led to the following problem: is there a way to probabilistically partition a metric 
space into blocks of at most half the diameter, so that for any ball with “small” radius, the 
expected number of blocks that intersect the ball can be nicely bounded? The celebrated 
partitioning algorithms of Hartal [5] and Fakcharoenphol, Rao, and Talwar [13] guarantee 
that the probability that such a ball is intersected by two or more blocks is nicely bounded. 
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However, their bounds on the probability that a small ball is intersected do not directly 
imply a good bound on the expected number of blocks intersected by a small ball. Indeed, if 
one employs the partitioning algorithm of [13], the expected number of blocks intersected 
by a small ball can be quite “large”. Fortunately, the desired bound on the expectation 
can be shown to hold for the algorithm of Bartal [5] , even though he did not study the 
expectation itself. We use a similar partitioning scheme and derive the expectation bound in 
Section 2, using an analysis that closely tracks previous work [1, 6 , 19]. While the bound 
on the expectation is easily derived from previous work, our work is the first to study and 
fruitfully apply this bound. 

The algorithms for MCC and ^-clustering, which use the partitioning scheme of Section 2, 
are described in Section 3 and 4, respectively. In Section 5, we consider the approximability 
of a variant of the MCC where we allow a to be part of the input. For a > log jWj, we show, 
under standard complexity theoretic assumptions, that no polynomial (or quasi-polynomial) 
time algorithm for MCC can achieve an approximation factor better than 0(log jWj). This 
partly explains the dependence on a of the running time of our algorithms. 

The Partitioning Scheme 

Let Z be a point set with an associated metric d, let P C Z be a point set with at least 
2 points, and n > jPj be a parameter. For Q Q Z, denote the maximum interpoint 
distance (or diameter) of Q by diam(Q). Consider any partition of P into subsets (or blocks) 
{Pi, P 2 ,..., Pt}, where 2 < t < [PJ. Abusing notation, we will also view (Pi, P 2 ,.. ■, P*} 
as a sequence of blocks. We say that Pi non-terminally (resp. terminally) intersects a ball 
B if Pi intersects B and it is not (resp. it is) the last set in the sequence Pi, P 2 , ■ ■ ■, Pt 
that intersects B. We would like to find a partition jPi, P 2 ,..., Pt} of P that ensures the 
following properties: 

1. For each 1 <i <t, diam(Pi) < diam(P)/2. 

2. For any ball B (centered at some point in Z) of radius r < the expected size 

of the set {i\Pi C P 7 ^ 0} is at most 1 -|- logn, where c > 0 is a constant. In 

other words, the expected number of blocks in the partition that intersect B is at most 

l + CdlS^logn. 

3. For any ball B (centered at some point in Z) of radius r < 1 the expected number 

of blocks in the partition that non-terminally intersect B is at most log n, where 

c > 0 is a constant. 

We note that the second property follows from the third, as the number of blocks that 
intersect ball B is at most one more than the number of blocks that non-terminally intersect 
B. We will design a probabilistic partitioning algorithm that finds a partition with the 
desired properties. 

2.0.1 The Partitioning Scheme of [13] 

We first explain why the probabilistic partitioning algorithm of [13] does not achieve this 
guarantee. In this algorithm, we first pick a (3 uniformly at random from the interval [|, j], 
where <5 = diam(P). Also, let tti, 7 r2 ,..., TTp be a permutation of P chosen uniformly at 
random. We compute Pi,P 2 , ■ ■ ■ ,Pp in order as follows. Suppose we already computed 
Pi,...,Pi-i. We let 


Pi = {a; e P \ (Pi U P2 U • • • U Pi_i) : d{x,TTi) < /?}. 
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We will refer to Pi as tt^’s cluster. We return the partition {Pi \ Pi ^ 0}. 

Consider the following weighted tree. Let vertex u be connected to vertices ui,U 2 , ■ ■ ■ ,Ub 
using edges of weight ^ . Here, n, 6, and b are parameters. Let Vi,V 2 , ■ ■ ■ ,Vb be disjoint 
sets with b vertices each. For each i, m is connected to every vertex in Vi using edges of 
weight I — ^ . Finally, z is a new vertex that is connected to u using an edge of weight 

Consider the metric induced by this weighted graph, and let P denote the vertex set. 
That is P = {m} U U IJ^ Fi U {z}. Set n := |P| and note that b — Q{y/n). Also, 

S = diam(P). 

Let B = B{u^r) where r = Yeio^- Notice that the ball B consists of the points 
{u, ui,U 2 ,. ■ ■, Ub}. Consider running the probabilistic partitioning algorithm of [13], described 
above, on P. We argue that the expected number of blocks in the output partition that 
intersect B is which is asymptotically larger than 1 + c logn = 0(1). 

Fix 1 < i < b. We observe that in the output partition, Ui belongs to the cluster of some 
point in {u, ui, U 2 ,..., Mb} or of some point in Vi. Call Ui isolated if Ui belongs to the cluster 
of some point in Vi. If Ui is isolated, the cluster containing Ui does not contain any of the 
other Uj. Thus, the number of blocks of the output partition that intersect B is at least the 
number of vertices in {mi, M2, ..., Mbjthat are isolated. 

Note that Ui is isolated if the following two events occur: (a) P € [j — Yg^ogU’ll’ (^) 
some vertex in Vj appears in the random permutation tti , 7r2,..., tt^ before all vertices in 
{m, Ml,..., Mb}. The probability of these two events occuring is It follows that the 

expected number of isolated vertices, and thus the expected number of blocks in the partition 
that intersect B, is H(j^) = 

2.0.2 Probability Distribution 

Before describing our partitioning algorithm, we consider a probability distribution that it 
uses. Given a positive real 6 and an integer k > 2, the distribution is denoted by dist((5, k). 
The probability density function (pdf) / of dist(d, k) is the following: 


fix) = 


0 if x < S/8 and x > S/4 

s 


if| + (z-l) 


Slogfe — 


S 2 

8 log fc 2 ^ _ 

S k ^^4 Slogfe — — 4 


<x<l+i ^ 


8 log k 


for 1 < i < log fc — 1 


<X<°ii 


The following observation shows that / is indeed a density function. 

► Observation 2.1. / is a probability density function. 

Proof. It is sufficient to show that / satisfies the two properties of density function. As S 
and fc are nonnegative it is easy to see that f{x) > 0 for x G (—oo,+oo). Also, 


fix)dx = 


‘8 + siogfc 8 log A: 1 


" 8 + 8 log fc g log fc 1 






22 


dx 


5 I (logfc —1)(5 


8 log fc g log fc 1 


_ (log k — 2)5 

8 log k 




dx - 


8 log fc 2 
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^ logfc-2 ^ 


1 ^ 1 2 ^ ,1 2 ^ 
“2 ^ + V -1 

1=0 
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Consider the interval |]. Now divide the interval into log k subintervals of equal length. 
The interval for 1 < i < logfc — 1 is defined as [§ + (* — 1) giogfc ’ I + ^ slogfc )• 
interval is — g^ , j]. Denote the interval by Ij for 1 < j < log/c. 

To sample a (3 according dist(i5, k), we first pick one of these intervals from the distribution 
that assigns a probability of 1/2^ to Ij for 1 < j < log A: — 1, and a probability of 2/k to 
hogk- Having picked an interval Ij, we generate (3 uniformly at random from it. 

Now we discuss a vital property of the distribution dist((5, k) which we use in the analysis 
of the partitioning algorithm. For an event E, let Pr[i?] denotes the probability that E 
occurs. Now consider the following random process. We sample a value p from dist((^, k). 
Let Ej denotes the event that p G Ij. Then we have the following observation. 

► Observation 2.2. Pr[Ej\ — Pf[Ei\ for I < j < logk — 1. 

Proof. The proof follows from the definition of the pdf of dist((5, k). 


Pr[E,] 


8 8 log k 


1 + 0-1) 8ll 


Slog/c 1 , 1 12 2 1,^ 

-;- rdx = — = —r-1 = —r(l — 

d 21 20 23 k k 2E 
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1 
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1 

2logA;—1 
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Pr[ii)j_|_i] + ... + Pr[£’iog/c_i] + Pr[i?iogfc] — Pr[Ei] 

i=j + l 



2 

k 
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2.0.3 Partitioning Algorithm 

Now we move on to the partitioning algorithm, which we recall, is given Z, a metric d on Z, 
P C Z, and a parameter n > \P\. The procedure RAND-PARTITION(P) described below 
(as Algorithm 1) takes a point set P Q Z as input and outputs a partition with at most 
|P| subsets. Suppose that P = {pi,... ,P\p\}. The algorithm then generates Pi,P 2 , ■ ■ ■, P\p\ 
in order via the for loop. Suppose that Pi, P 2 , ■ ■ ■, Pi-i have already been constructed, 
and Q = P \ (Pi U P 2 U • • • U Pi-i). To construct Pi, the procedure samples a Pi from 
dist(diam(P), n). The choice of pi is done independently of the choices of the other points. 
Then Pi is set to {x G Q \ d{x,pi) < pi}. Note that this is done in the Pth iteration of the 
for loop. (Note that pi might not be assigned to Pi, as it could already be assigned to some 
other subset.) 

We show that RAND-PARTITION(P) satisfies the two guarantees mentioned before. 
To see that the first guarantee, note that the P values are chosen from the distribution 
dist(diam(P), n) which ensures that Pi < diam(P)/4 for 1 < i < p. Now each point in a subset 
Pi is at a distance at most Pi from pi. Thus by triangle inequality diam(Pi) < diam(P)/2. 
In the next lemma we will show that the second guarantee also holds. Before that we have a 
definition. 

We say that Pi non-terminally (resp. terminally) intersects B{y,r) if Pi intersects B{y,r) 
and it is not (resp. it is) the last set in the sequence Pi, P 2 ,..., Pp that intersects B{y, r). 
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Algorithm 1 RAND-PARTITION(P) 

Require: A subset P = {pi,... ,p\p\} C Z 
Ensure: A partition of P 
1 : |P| 

2: Q^P 

3: for i = 1 to p do 

4: sample a Pi from dist(diam(P), n) corresponding to pi 

5: Pi ^ {x e Q\d{x,pi) < p^} 

6 : Q Q \ Pi 

7: return {PpPi ^ 0 and 1 < * * < p} 


► Lemma 1. (Partitioning Lemma) There is a constant c such that for any ball B{y,r) with 
r < ; the expected number of blocks in the output of RAND-PARTITION(P) that 

intersect B(y,r) is at most 1 + c logn. Moreover, the expected number of blocks that 

intersect the ball non-terminally is at most logn. 

The intuition for the lemma is as follows. Consider the beginning of the i’th iteration of 
the for loop and assume that ball B{y, r) is not fully contained in the union of the previously 
constructed blocks Pi,..., Pi-i- Then, considering the choice of Pi, the probability that 
B{pi,Pi) fully contains the ball B{y,r) is nearly as large as the probability that B{pi,Pi) 
intersects B{y,r). If B{pi,Pi) fully contains the ball B{y,r), then of course none of the 
blocks Pipi,Pip 2 , ■ ■ ■ ,Pp intersect B{y, r). We now proceed to the proof. 

Proof. For a point x G Z and subset Q C Z, let dniin(a:, Q) = min^gg d{x, q) and dmaxix, Q) = 
uiSLXq^Q d{x,q). Fix the ball B{y,r) with r < For each 1 < i < p, consider the 

indicator random variable Ti defined as follows: 

j, _ ( ^ Pi intersects B{y,r) 

* \ 0 otherwise 

Let the random variable T = 'YTi=i Pi t)e the number of subsets that the ball intersects. 
Then E[T] = E[T,] = intersects B{y,r)]. 

Clearly, there is at most one Pi that is the last one that intersects B{y,r). Thus, 

p p 

''^^Pr[Pi intersects B{y,r)] < l + non-terminally intersects B{y,r)]. 

i=l i=l 

Let Xi = dniin{Pi,B{y, r)) and yi = dmax(Pi, B(y, r)). By the triangle inequality, yi-Xi< 
2r. Denote by (5'*) the event that Pi lands in the interval [xi,yi]. Note that for Pi to non- 
terminally intersect B(y,r), the event (5"*) must occur. Thus, if the interval [xi,yi\ does not 
intersect the interval diam(p) then Pr[Pi non-terminally intersects i?(?/,r)] = 0. 

We therefore turn to the case where [xi,yi\ does intersect the interval diam(P) 

Recall that in defining the probability distribution dist((ijam(P),n), we have divided the 
latter interval into log n subintervals Ii,l 2 , ■ ■ ■, I\ogn of equal length. Denote by a/ the 
probability 

Pr[ a random sample drawn from dist(diam(P), n) belongs to J/]. 

For convenience, define Iiogn+i = [diam(P)/4,oo) and oiogn+i = 0. 
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Let be the subinterval that contains Xi. (In case Xi < ^ let k = 1.) The length 

of [xi,yi] is at most 2r, 2r < diam(P)/81ogn, and the length of each of the subintervals 
is diam(P)/8logn. Thus [xi^Ui] can intersect at most one more subinterval, and this is 
Let ri and r 2 be the length of Ii. fl [xi,yi] and I 1-+1 n [xi,yi] respectively. Note that 
ri+r 2 <yi- x^< 2r. 

To bound Pr[Pi non-terminally intersects B{y, r)], we now have two cases. We say that pi 
is far (from the ball B{y, r)) if k S {log n—1, log n}. We say that pi is near if I < k < log n—2. 

Case 1: pi is far. In this case ai-,ai.+i < Thus 


Pr[S'*] < Pr[l3^ lands in n [xi, yf\] + Pr[l5^ lands in /;,+i n [xi,yj\] 

~ diam(P)/81ogn°^*’ diam(P)/8 log 

^ 2r 2 

“ diam(P)/81ogn n 
32r log n 
n ■ diam(P) 


Thus, Pr\Pi non-terminally intersects B{y^r)] < Pr[S'^] < 


32r log n 
n-diam(P) ' 


Case 2: pi is near. For such a Pi we have the following crucial observation. 


► Claim 2.1. Pr[Pi non-terminally intersects B{y,r)] < 


dilm(p ) terminally intersects B{y,r)]. 


Proof. Suppose that Pi, P 2 , ■ ■ ■, Pi-i have been chosen and B{y,r) C Pi U P 2 U • • • U 
Pi-i- Conditioned on such a history, we have Pr[Pi non-terminally intersects P(y,r)] = 
Pr[Pi terminally intersects B{y,r)] = 0 and the claimed inequality holds. 

Now suppose that B{y, r) \ [Pi U P 2 U • • • U Pi_i) 7 ^ 0. Let us condition on such a history. 
Then, Pi terminally intersects B[y,r) if j3i lands in I 1-+2 U Ii ,+3 U • • • U /logn- Thus, using 
Observation 2.2, 


Pr[Pi terminally intersects B[y, r)] > ai ^+2 + ai.+z -I- • • • -I- aiog„ = ai^+i. 
On the other hand. 


Pr\Pi non-terminally intersects B[y,r)] 


< Pr[S^ I A e Ih U lu+i] ■ Pr[P^ G U h^+i] 


< 


< 


< 


ri 


+ 


1 


r2 


3 diam(P)/81ogn 3 diam(P)/8 log n 

32 • r log n 


[aii + o^ii+i) 


3 • diam(P) 
32 • r log n 
diam(P) 


• 30;.+1 

Pr[Pi terminally intersects B[y,r)]. 
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Hence the expected number of subsets that intersect B(y,r) non-terminally is 


< 


< 


< 


< 


'^^Pr[Pi non-terminally intersects i3(y,r)] 

i=l 

Pr[Pi non-terminally intersects B{y^r)] 

i:pi is far 

+ PT[Pi non-terminally intersects B{y,r)] 


i:pi is near 

32r log n ^ 1 
diam(P) n 


32r log n 
diam(P) 


p 

y^Pr[Pi terminally intersects B(y,r)] 

i=l 


For the last inequality, we used the fact that terminally intersects B{y,r)] = 1, 

since there is exactly one Pi that terminally intersects P(y, r). Putting the two cases together, 
we have 


E[T] < 1 -h c 


diam(P) 


logn. 


◄ 


We conclude by summarizing the result. 

► Theorem 2. Let Z be a point set with an associated metric d, let P C Z be a point set 
with at least 2 points, and n > |P| be a parameter. There is a polynomial-time probabilistic 
algorithm RAND-PARTITION{P) that partitions P into blocks {Pi, P 2 ,..., Pt} and has the 
following guarantees: 

1 . For each 1 <i <t, diam{Pi) < diam{P)/2. 

2 . There is a constant c > 0 so that for any ball B (centered at some point in Z) of radius 

r < , the expected size of the set {i\Pi H P 7 ^ 0} is at most 1 logn and 

the expected number of blocks that non-terminally intersect B is at most log n. 

I 3 I Algorithm for MCC 

We now describe our (1 -|- e)-factor approximation algorithm for the MCC problem. Recall 
that we are given a set X of clients, a set Y of servers, and a metric d on X UY. We wish to 
compute a cover for X with minimum cost. Let m = \Y\ and n = \X\. 

For PCX, let opt(P) denote some optimal cover for P. Denote by cost(P) the cost of a 
ball B (the a-th power of B’s radius) and by cost(P) the cost X]B 6 BCOst(P) of a set B of 
balls. 

To compute a cover for P, our algorithm first guesses the set Q C opt(P) consisting of 
all the large balls in opt(P). As we note in the structure lemma below, we may assume that 
the number of large balls in opt(P) is small. We then use the algorithm of Theorem 2 to 
partition P into jPi, P 2 ,..., Pt}. For each 1 < i < t, we recursively compute a cover for the 
set P/ C Pt of points not covered by Q. 

To obtain an approximation guarantee for this algorithm, we use the guarantees of 
Theorem 2. With this overview, we proceed to the structure lemma and a complete description 
of the algorithm. 
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3.1 A Structure Lemma 

It is not hard to show that for any 7 > 1 and PCX such that diam(P) is at least a constant 
factor of diam(X U Y), opt(P) contains at most ( 0 / 7 )“ balls of radius at least diam(P)/ 7 . 
Here c is some absolute constant. The following structural lemma extends this fact. 

► Lemma 3. Let P C X, 0 < A < 1 and 7 > 1, and suppose that opt{P) does not eontain 
any ball of radius greater than or equal to 2a ■ diam(P)/X. Then the number of balls in 
opt{P) of radius greater than or equal to diam(P)/j is at most c(A, 7 ) ;= ( 9 a 7 /A)“. 

Proof. Suppose that opt(P) does not contain any ball of radius greater than or equal 
to 2a ■ diam(P)/A. Note that each ball in opt(P) intersects P and has radius at most 
2a ■ diam(P)/A. Thus the point set {z G AT U F | z G P for some B G opt(P)} has diameter 
at most diam(P) + 8 a • diam(P) jX <Qa ■ diam(P)/A. It follows that there is a ball centered 
at a point in Y, with radius at most 9a • diam(P)/A that contains P. 

Let t denote the number of balls in opt(P) of radius greater than or equal to diam(P)/ 7 . 
By optimality of opt(P), we have t ■ (diam(P)/ 7 )“ < (9a • diam(P)/A)“. Thus t < ( 9 a 7 /A)“. 


3.2 The Algorithm 

We may assume that the minimum distance between two points in Af is 1. Let L = 
1 + log(diam(Ai)). As we want a (1 + £:)-approximation, we fix a parameter A = £/2P. Let 
7 = where c is the constant in Theorem 2. Denote V to be the set of balls such that 

each ball is centered at a point of y G Y and has radius r = d{x, y) for some x € X. We note 
that for any PCX, any ball in opt(P) must belong to this set. Note that \T>\ < mn. Recall 
that c(A, 7 ) = ( 9 a 7 /A)“. 

With this terminology, the procedure POINT-COVER(P) described as Algorithm 2 
returns a cover of P C A. If |P| is smaller than some constant, then the procedure returns an 
optimal solution by searching all covers with a constant number of balls. In the general case, 
one candidate solution is the best single ball solution. For the other candidate solutions, the 
procedure first computes a partition {Pi,..., P.^} of P, using the RAND-PARTITION(P) 
procedure. Here RAND-PARTITION(P) is called with Z = XCY and n = \X\ > |P|. Then 
it iterates over all possible subsets of V of size at most c(A, 7 ) containing balls of radius 
greater than diam(P)/ 7 . For each such subset Q and 1 < i < r, it computes the set P{ C P^ 
of points not covered by Q. It then makes recursive calls and generates the candidate solution 
P0INT-C0VFR(P/). Note that all the candidate solutions are actually valid covers 
for P. Among these candidate solutions the algorithm returns the best solution. 

Our overall algorithm for MCC calls the procedure POINT-COVFR(A) to get a cover of 
A. 

3.3 Approximation Guarantee 

For PCX, let level(P) denote the smallest non-negative integer i such that diam(P) < 2L 
As the minimum interpoint distance in A is 1, level(P) = 0 if and only if |P| < 1. Note that 
level(A) < L. 

The following lemma bounds the quality of the approximation of our algorithm. 

► Lemma 4. POINT-COVER{P) returns a solution whose expeeted eost is at most (1 -|- 
Xy cost{opt{P)), where I = level{P). 
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Algorithm 2 POINT-COVER(P) 

Require: A subset PCX. 

Ensure: A cover of the points in P. 

1 : if |P| is smaller than some constant k then 

2 : return a minimum solution by checking all covers with at most k balls. 

3: sol the best cover with one ball 
4: cost ^ cost{sol) 

5: Let {Pi,..., Pt} be the set of nonempty subsets returned by RAND-PARTITION(P) 
6 : Let B be the set of balls in V having radius greater than 
7: for each Q C B of size at most c(A, 7 ) do 
8 : for z = 1 to T do 

9: Let Pl = [pG Pi\p^ UbsQ 

10: O' ^ QU UiLi POINT-COVER(P’') 

11: if cost(Q') < cost then 

12 : cost cost(Q') 

13: sol ^ Q' 

14: return sol 


Proof. We prove this lemma using induction on 1. If Z = 0, then |P| < 1 and POINT- 
COVER(P) returns an optimal solution, whose cost is cost(opt(P)). Thus assume that I > 1 
and the statement is true for subsets having level at most l — l. Let P C A be a point set with 
level(P) = 1. If |P| is smaller than the constant threshold k, POINT-COVER(P) returns an 
optimal solution. So we may assume that |P| is larger than this threshold. We have two cases. 

Case 1: There is some ball in opt(P) whose radius is at least 2a • diam(P)/A. Let B denote 
such a ball and r(P) > 2a •diam(P)/A be its radius. Since (l-f A/2a)r(P) > r(P)-|-diam(P), 
the concentric ball of radius (1 -I- A/2a)r(P) contains P. It follows that there is a cover for 
P that consists of a single ball and has cost at most 

(1 -I- A/2a)“r(P)“ < (1 -I- A)cost(opt(P)) < (1 -I- A)*cost(opt(P)). 

Case 2: There is no ball in opt(P) whose radius is at least 2a • diam(P)/A. Let Qq C opt(P) 
contain those balls of radius at least diam(P)/ 7 . It follows from Lemma 3 that |Qo| < c(A, 7 ). 
Thus the algorithm considers a Q with Q = Qq. Fix this iteration. Also fix the partition 
{Pi,..., Pr} of P computed by RAND-PARTITION(P). RAND-PARTITION ensures that 
diam(Pi) < diam(P)/2 for 1 < i < r. Thus diam(P/) < diam(P)/2 and the level of each P{ 
is at most l—l. Hence by induction the expected value of cost(POINT-COVER(P/)) is at 
most (1 -I- A)^“^cost(opt(P/)). 

T 

Let S' = opt(P) \ Qo- We argue below that the expected value of X) cost(opt(P/)) is at 

Z —1 

most (1 -I- A)cost(5'). Assuming this, we have 


P[cost(Qo U y POINT-COVER(P;))] 


T 

< cost(Qo) + (1 + A)^~^P[y^cost(opt(P{))] 

i=l 

< cost(Qo) + (1 + A)^cost(5') 

< (1-I-A)^cost(opt(P)). 
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Thus POINT-COVER(P) returns a solution whose expected cost is at most (l+A)^cost(opt(P)), 
as desired. 

We now argue that the expected value of X)I=i cost(opt(P/)) is at most (1 + A)cost(5'). 
Let Bi consist of those balls in S' that intersect Pi. For B G S' , let denote the number 
of blocks in the partition {Pi,..., Pr} that B intersects. Because Bi is a cover for P/, we 
have cost(opt(P{)) < cost(Pi). Thus 

r T 

^cost(opt(Pj')) < ^cost(Pj) = ^ fi{B)cost{B). 

2=1 2=1 B^S' 


By definition of Qq, any ball B G S' = opt(P)\ Qo has radius at most ^ ^ 

where c is the constant in Theorem 2. We may assume that c > 16 and hence ^ e — 
• Theorem 2 now implies that 


p[Ms)] < 1 + 


c • r{B) logn 
diam(P) 


< 1 + 


clogn 

diam(P) 


A • diam(P) 
clog n 


1 + A. 


Thus the expected value of cost (opt (P/)) is at most 


P[/r(P)]cost(P) < (1 + A) cost(P) = (1 + A)cost(5'), 
BeS' Bes' 


as claimed. 


•4 


We conclude that the expected cost of the cover returned by POINT-COVER(X) is at 
most (1 + A)^cost(opt(X)) < (1 + e)cost(opt(X)), since A = e/2L. 

Now consider the time complexity of the algorithm. POINT-COVER(P) makes 
direct recursive calls on subsets of diameter at most diam(P)/2. Thus the overall time com¬ 
plexity of P0INT-C0VER(A:) can be bounded by Plugging in A = e/2L, 

7 = clogn/A, and c(A,7) = (9a7/A)“, we conclude 

► Theorem 5. There is an algorithm for MCC that runs in time (mn) ^ ^ ' and 

returns a cover whose expected cost is at most (1 -be) times the optimal. Here L is 1 plus the 
logarithm of the aspect ratio of X, that is, the ratio of the maximum and minimum interpoint 
distances in the client set X. 


Using relatively standard techniques, which we omit here, we can pre-process the input 
to ensure that the ratio of the maximum and minimum interpoint distances in X is upper 
bounded by a polynomial in However, this affects the optimal solution by a factor of 
at most (1 -b e). After this pre-processing, we have L = 0(log Using the algorithm 
in Theorem 5 after the pre-processing, we obtain a (1 -b e) approximation with the quasi¬ 
polynomial running time 0(2'°®'^* mny jjgj-g ^]^g 0(1) hides a constant that depends on a 
and e. 


Algorithm for fc-clustering 

Recall that in ^-clustering we are given a set X of points, a metric d on X, and a positive 
integer k. Let |A| = n. For PCX and integer k > 0, let opt(P, k) denote an optimal 
solution of K-clustering for P (using balls whose center can be any point in A). We reuse the 
notions of level(P), cost(P) and cost(P) from Section 3, for a point set P, a ball B, and a 
set B of balls, respectively. Denote V to be the set of balls such that each ball is centered at 
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a point oi y X and has radius r = d{x, y) for some x G X. We note that for any PCX, 
any ball in opt(P, k) must belong to this set. Note that |I?| < 

To start with we prove a structure lemma for fc-clustering. 

► Lemma 6. Let P C X, a be a positive integer, and 7 > 1. Then the number of balls in 
opt{P,K) of radius greater than or equal to diam{P)/j is at most 0(7) := 7“. 

Proof. Note that any ball centered at a point in P and having radius diam(P) contains all 
the points of P. Now by definition of diam(P) and T>, there is a point x G P such that the 
ball P(a::,diam(P)) e T). Hence opt(P, k) < diam(P)“. 

Let t denote the number of balls in opt(P, k) of radius greater than or equal to diam(P)/y. 
By optimality of opt(P, k), we have t ■ (diam(P)/7)“ < diam(P)“. Thus t < 7“. ◄ 

Like in the case of MCC, we assume that the minimum distance between two points in 
X is 1. Let L = 1 + log(diam(X)). We fix a parameter A = ef6L. Let 7 = where c is 

the constant in Theorem 2. 

We design a procedure CLUSTERING(P, k) (see Algorithm 3) that given a subset P 
of X and an integer k, returns a set of at most (1 + 3A)*«; balls whose union contains P, 
where I = level(P). We overview this procedure, focussing on the differences from the 
procedure POINT-COVER() used to solve the MCC problem. In CLUSTERING(P, k), 
RAND-PARTITION(P) is called with Z = X and n = |X| > |P|. We require two properties 
of the partition {Pi,..., P,-} of P computed by RAND-PARTITION(P). Let Qq be the 
set containing the large balls of opt(P,/c), that is, those with radius at least diam(P)/7. 
Let S' — opt(P, k) \ Qo denote the set of small balls, and let S'i C S' consist of those 
balls that contain at least one point in Pi that is not covered by Qo- We would like (a) 
X]r=i cost(5'i) < (1 + 3A)cost(5'), and (b) ^ (1 + 3A)|5'|. Theorem 2 ensures 

that each of (a) and (b) holds in expectation. However, we would like both (a) and (b) to 
hold simultaneously, not just in expectation. For this reason, we try 0(logn) independent 
random partitions in Line 6, ensuring that with high probability, properties (a) and (b) hold 
for at least one of them. 

Now let us fix one of these 0(log n) trials where we got a partition (Pi,..., P^} satisfying 
properties (a) and (b), and also fix an iteration in Line 9 where we have Q = Qq. Let 
P/ ^ Pi be the points not covered by Qq. For each 1 < i < r and 0 < < (1 + 3 A)k, we set 
cluster(P/, Ki) to be the cover obtained by recursively invoking CLUSTERING(P{, Ki) (as 
in Line 13). 

Let us call a tuple (ki, K 2 , ■ ■ ■ ,Kr) of integers valid if 0 < < (1 + 3A)(k — |Qo|) and 

X)r=i < (1 + 3 A)(k — |Qo|). We would like to minimize X)r=i cost(cluster(P/, m)) over all 
valid tuples (ki, Av 2, ..., Kr)- As there are too many valid tuples to allow explicit enumeration, 
we solve this optimization problem in Lines 14-21 via a dynamic programming approach. 

This completes our overview. Our overall algorithm for fc-clustering calls the procedure 
CLUSTERING(A, k). Next we give the approximation bound on the cost of the solution 
returned by CLUSTERING(P, k). 

► Lemma 7. For any PCX and an integer k > 1, CLUSTERING{P, k) returns a solution 

consisting of at most (1 + 3A)*k balls and with probability at least 1 — , the cost of the 

solution is at most (1 + SXf cost{opt{P, k)), where I = level{P). 

Proof. We prove this lemma using induction on 1. If Z = 0, then |P| < 1 and CLUSTERING(P, 
k) returns an optimal solution, whose cost is cost (opt (P, k)). Thus assume that I > 1 and 
the statement is true for subsets having level at most l — l. If |P| is smaller than the constant 
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Algorithm 3 CLUSTERING (P, k) 

Require: A subset P C X, an integer k. 

Ensure: A set of balls whose union contains the points in P. 

1 : if |P| is smaller than some constant /3 then 

2 : return a minimum solution by checking all solutions with at most min{K,/3} balls. 

3: sol the best solution with one ball 
4: cost ^ cost{sol) 

5: I -ir- level(P) 

6 : for all 2 log 3/2 n iterations do 

7: Let {Pi,..., Pr} be the set of nonempty subsets returned by RAND-PARTITION(P) 

8 : Let B be the set of balls in V having radius greater than 

9: for each Q C B of size at most 0 ( 7 ) do 

10: for z = 1 to T do 

11: Bet Pi = {pG Pi \ p^Ub^qB} 

12 : for each 1 < z < r and 0 < ki < (1+3A)k do 

13: cluster(L{', m) e- CLUSTERING(P/, m) 

14: for z = 0 to r — 1 do 

15: Ri ■<— ljj=j_|_i Pj 

16: for /Cl = 0 to (l+3A)/c do 

17: cluster(PT-_i, Ki) ^ cluster(P{, Ki) 

18: for all z = r — 2 to 0 and 0 < /ci < (l+3A)/c do 

19: ^ ™’^k': 0 <k'<ki cost(cluster(P/^i,/ c') U cluster(Pi_|_i, Ki — k')) 

20: cluster(Pi, /Cl) ^ cruster(i{'+ 3 , /cU„) U cluster(Pi+i, /ci - /cU„) 

21 : Q' ^ Q U cluster(Po, (1 + 3A) • (/c - |Q|)) 

22 : if \Q'\ < (l+3A)^/c and cost(Q') < cost then 

23: cost ^ cost(Q') 

24: sol ^ Q' 

25: return sol 


threshold fi, CLUSTERING(P, /c) returns an optimal solution. So we may assume that |P| 
is larger than this threshold. 

Consider one of the 21og3/2fi iterations of CLUSTERING(P, /c). Fix the partition 
{Pi,..., Pr} of P computed by RAND-PARTITION(P) in this iteration. Let Qq be the set 
containing the balls of opt(P, /c) with radius at least diam(P)/7. It follows from Lemma 6 
that |Qo| < c(7). Fix the choice Q—Qq. 

RAND-PARTITION(P) ensures that diam(Pi) < diam(P)/2 for 1 < z < r. Thus 
diam(P/) < diam(P)/2 and the level of each P{ is at most I — 1. Hence by induction, 
cluster(P{,/Cl) contains at most (l+3A)^“^/ci balls and with probability at least 1 — 
its cost is at most (1 + 3A)*“^cost(opt(P{, /ci)) for 1 < /ci < (1 + 3A)/c. 

Let S' = opt(P, k) \ Qo- Thus |5'| < /c — |Qo|- We note that the union of the balls in S' 
contains the points in U}^iP/. Let S't C S' be the set of balls that intersect with P/ and 
thus the union of balls in S'i contains Pi, where 1 < z < r. We argue below that the value 
of X]I=i is sit most (1 + 3A)|5'| and the value of J2i=i cost(opt(P/, |5'i|)) is at most 
(1 + 3A)cost(5') with probability at least 1/3. Then the probability is at least 1 — that 
corresponding to one of the 21og3/2« iterations the value of J2i=i is at most (1 + 3A)|5'| 
and the value of cost(opt(P/, |5'i|)) is at most (1 + 3A)cost(5'). We refer to this event 
as the first good event. Now let us assume that the first good event occurs. 
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Note that by induction, cluster(P/, |5'i|) contains at most (1 + 3A)* balls and 

with probability at least 1 — , its cost is at most (1 + 3A)*“^cost(opt(P/, |5'i|)) for 

1 < i < T. The probability that for every 1 < i < r, the cost of cluster(P/, |5'i|) is at most 
(1 + 3A)*“^cost(opt(P/, |5'i|)) and it contains at most (1 + 3A)^“^|5'i| balls, is at least 





2=1 


\Pr\ - 1 

rr,2 


> 1 - 


|P|-2 


We refer to this event as the second good event. Assuming this second good event occurs, 
it follows by an easy induction on i that cluster(Pi, J2'j=i+i covers .Ri=Uj=i+i Pj 
most (1+3A)^“^ X]J=i+i \^'j\ balls and its cost is at most (1+3A)^“^ Sj=i+i cost(opt(Pj, |5'j|)). 
Thus cluster(Po, covers Rq = UlLi Pi with at most (1 + 3A)^“^ Yl'i=i balls 

and its cost is at most (1+3A)^“^ Si=i cost(opt(P,., |5'j|)). Hence QqU cluster(Po, Yl'i=i 
covers P. Now assuming both the good events occur, we have 


T 

|Qo U cluster(Po, |>g'i|)| 

2=1 

< 

T 

2o +^(1+3A)^ ^\S'i\ 

2=1 


< 

T 

Qo + (1+3A)* ^ tS'd 

2=1 


< 

|Qo| + (l+3A)d5l 


< 

|Qo| + (1+3A)*(k — |Qol) 


< 

(l+3A)'/c. 

r 

cost(Qo U cluster(Po, 5'. 

2=1 

d)) 

T 

< cost(Qo) + (1+3A)^~^ y^cost(opt(P{, 5'/)) 

2=1 



< cost(Qo) + (l+3A)*cost(5') 



< (l+3A)*cost(opt(P, k)). 


The probability that both the good events occur is at least 


(1 




1 ^ 1-1 
r^2 


Hence the statement of the lemma follows by noting that the value of ici 

cluster(Po, 1^1=1 is at most (1 + 3A) • (k - |Qo|)- 

We now argue that the probability that is greater than (1 + 3A)|5'| is at most 

1/3 and the probability that cost(opt(P/, |5'i|)) is greater than (1 + 3A)cost(5') is at 
most 1/3. Then using union bound, the probability that J2i=i is at most (1 + 3A)|5'| 
and cost(opt(P/, |5'i|)) is at most (1 + 3A)cost(5') is at least 1/3. 

For B G S', let fi{B) denote the number of blocks in the partition {Pi,..., Pr} that 
non-terminally intersect B. We note that at most one block can terminally intersect B. 
Because S'i is a cover for P/ with |5'i| balls, we have cost(opt(P/, |5'i|)) < cost(5'i). Thus 


^ cost(opt(P/, |5'i|)) < ^cost(5'i) = ^ (l+^(P))cost(P) = cost(5')+ ^ /r(P)cost(P). 
i=l i=l BgS' BgS' 


and. 


2 = 1 BGS' 


|5'| + 5] /r(P). 
BeS' 
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By definition of Qq, any ball B £ S' = opt(P, k) \ Qq has radius at most = 

A diam(p) g jg ^.j^g constant in Theorem 2. We may assume that c > 16 and hence 

^ Sw- Theorem 2 now implies that 

pr , nsn , c-r{B)logn clogn A ■ diam(P) _ 

^ ~ diam(P) ~ diam(P) clogn 

Using linearity of expectation, 

EIY, KB)]= Y. E[f^{B)]<\-\S'\. 

BeS' BeS' 

Now by Markov’s inequality, 

Pr[Y MS)>3A-|5'|]<1/3. 

BeS' 


It follows that, 

r 

Pr[Y |5'i| > (1 + 3A)|5'|] 

< Pr[\S'\+Yt^iB)>{l + 3X)\S'\] 

BeS' 

< PrlY ^^{B)>^^■\S'\]<^/3. 

BeS' 

Again using linearity of expectation, 

E[ /c(P)cost(P)] = Y^ P[/i(P)]cost(i?) < A • cost(5'). 
BeS' BeS' 

Now by Markov’s inequality, 

Pr[ Y^ ^(P)cost(i?) > 3A • cost(5')] < 1/3. 

BeS' 


It follows that, 

r 

Pr[^ cost (opt (P/, |5'i|)) > (1 + 3A)cost(5')] 

i=l 

< Pr[cost(5') + Y^ fi{B)cost{B) > (1 + 3A)cost(5')] 
BeS' 

— Br[ ^ ^(P)cost(P) > 3A • cost(5')] < 1/3. 


Since A = e/ 6 P, (1 + 3A)^ < 1 + e. Thus we conclude that with probability at least 1 — 
CLUSTERING(X, k) returns a solution with at most (1 + e)k balls whose cost is at most 
(1 + £)cost(opt(A, k)). 

Now consider the time complexity of the algorithm. CLUSTERING(P, k) makes 
direct recursive calls on subsets of diameter at most diam(P)/2. Thus the overall time 
complexity of CLUSTERING (A,/c) can be bounded by Plugging in A = e/6L, 

7 = clogn/A, and 0 ( 7 ) = 7 “, we conclude 
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► Theorem 8. There is a randomized algorithm for k-clustering that runs in time ^^ 

and with probability at least 1 — ^ returns a solution with at most (1 + e)k balls whose cost 
is at most (1 + e) times the optimal. Here L is 1 plus the logarithm of the aspect ratio of X, 
that is, the ratio of the maximum and minimum interpoint distances in the set X. 

I 5 I Inapproximability Result 

In this section we present an inapproximability result which complements the result in 
Section 3. In particular here we consider the case when a is not a constant. The heart of 
this result is a reduction from the dominating set problem. Given a graph G = {V,E), a 
dominating set for G is a subset V of V such that for any vertex a € V \ V, a is connected 
to at least one vertex of V by an edge in E. The dominating set problem is defined as follows. 

Dominating Set Problem (DSP) 

INSTANCE: Graph G = {V,E), positive integer k < \V\. 

QUESTION: Is there a dominating set for G of size at most fc? 

The following inapproximability result is proved by Kann [20]. 

► Theorem 9. There is a constant c > 0 such that there is no polynomial-time clog |U|-/actor 
approximation algorithm for DSP assuming V ^ NV. 

The following theorem shows an inapproximability bound for MCC when a > log jXj. 

► Theorem 10. For a > log jXj, no polynomial time algorithm for MCC can achieve an 
approximation factor better than clog |A| assuming V ^ NV. 

Proof. To prove this theorem we show a reduction from DSP. Given an instance (G = 
{V,E),k) of DSP we construct an instance of MCC. The instance of MCC consists of 
two sets of points X (clients) and Y (servers), and a metric d defined on A U U. Let 
V = {vi,V 2 ,. •., Vn}, where n = jUj. For each Vi gV, Y contains a point yi and X contains 
a point Xi- For any point p G X U Y, d{p,p) = 0. For i,j G [n], d{xi,yj) is 1 if z = j or 
the edge {vi,Vj) G E, and d{xi,yj) is 3 otherwise. For i,j G [n] such that i ^ j, we set 
d{xi,Xj) = d{yi,yj) = 2 . 

Consider two nonadjacent vertices Vi and Vj. For any xt G X such that t ^ i,j, 
d{xi,xt) + d{xt,yj) > 3. Similarly, for any yt GY such that t ^ i,j, d{xi,yt) + d{yt,yj) > 3. 
Thus d defines a metric. Next we will prove that G has a dominating set of size at most k iff 
the cost of covering the points in X using the balls around the points in Y is at most k. 

Suppose G has a dominating set J of size at most k. For each vertex vj G J, build a 
radius 1 ball around yj. We return this set of balls B as the solution of MCC. Now consider 
any point Xi G X. If Vi G J, then Xi is covered by the ball around yi. Otherwise, there must 
be a vertex Vj £ J such that {vi,Vj) G E. Then d{xi,yj) is 1 and Xi is covered by the ball 
around yj. Hence S is a valid solution of MCC with cost at most k. 

Now suppose there is a solution B of MCC with cost at most k. If k > |A|, then U is a 
dominating set for G of size |A| < k. If k < |A|, our claim is that the radius of each ball in 
is 1. Suppose one of the balls B has a radius more than 1. Then the way the instance of 
MCC is created the radius should be at least 3. Hence fc > 3“ > > |A|, which is a 

contradiction. Now consider the set of vertices J corresponding to the centers of balls in B. 
It is not hard to see that J is a dominating set for G of size at most k. 

Let OPT be the cost of any optimal solution of MCC for the instance (A, Y, d). Then by 
the properties of this reduction the size of any minimum dominating set for G is OPT. Thus if 
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there is an approximation algorithm for MCC that gives a solution with cost (clog |X|)-OPT, 
then using the reduction we can produce a dominating set of size (clog |l^|)-OPT. Then from 
Theorem 9 it follows that V = NV. This completes the proof of our theorem. ◄ 
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Conclusions 


One generalization of the MCC problem that has been studied [9, 8] includes fixed costs for 
opening the servers. As input, we are given two point sets X (clients) and Y (servers), a 
metric on Z = A U 1", and a facility cost fy>0 for each server y G Y. The goal is to find a 
subset Y' C Y, and a set of balls {By \y gY' and By is centered at y} that covers A, so as 
to minimize J^y^Y'ifv + It is not hard to see that our approach generalizes in a 

straightforward way to give a (1 + e) approximation to this problem using quasi-polynomial 
running time. To keep the exposition clear, we have focussed on the MCC rather than this 
generalization. 

The main open problem that emerges from our work is whether there one can obtain a 
(1 -|- £)-approximation for the fc-clustering problem in quasi-polynomial time. 
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